Continuous-Space Language Models for Statistical Machine Translation
نویسندگان
چکیده
منابع مشابه
Continuous-Space Language Models for Statistical Machine Translation
This paper describes an open-source implementation of the so-called continuous space language model and its application to statistical machine translation. The underlying idea of this approach is to attack the data sparseness problem by performing the languagemodel probability estimation in a continuous space. The projection of thewords and the probability estimation are both performed by a mul...
متن کاملContinuous Space Translation Models for Phrase-Based Statistical Machine Translation
This paper presents a new approach to perform the estimation of the translation model probabilities of a phrase-based statistical machine translation system. We use neural networks to directly learn the translation probability of phrase pairs using continuous representations. The system can be easily trained on the same data used to build standard phrase-based systems. We provide experimental e...
متن کاملConverting Continuous-Space Language Models into N-Gram Language Models for Statistical Machine Translation
Neural network language models, or continuous-space language models (CSLMs), have been shown to improve the performance of statistical machine translation (SMT) when they are used for reranking n-best translations. However, CSLMs have not been used in the first pass decoding of SMT, because using CSLMs in decoding takes a lot of time. In contrast, we propose a method for converting CSLMs into b...
متن کاملInvestigating Continuous Space Language Models for Machine Translation Quality Estimation
We present novel features designed with a deep neural network for Machine Translation (MT) Quality Estimation (QE). The features are learned with a Continuous Space Language Model to estimate the probabilities of the source and target segments. These new features, along with standard MT system-independent features, are benchmarked on a series of datasets with various quality labels, including p...
متن کاملLarge, Pruned or Continuous Space Language Models on a GPU for Statistical Machine Translation
Language models play an important role in large vocabulary speech recognition and statistical machine translation systems. The dominant approach since several decades are back-off language models. Some years ago, there was a clear tendency to build huge language models trained on hundreds of billions of words. Lately, this tendency has changed and recent works concentrate on data selection. Con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Prague Bulletin of Mathematical Linguistics
سال: 2010
ISSN: 1804-0462,0032-6585
DOI: 10.2478/v10108-010-0014-6